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3 <110> APPLICANT: KEASLING, JAY 

4 MARTIN , VINCENT 

5 PITERA, DOUGLAS 

6 KIM, SEON-WON 

7 WITHERS III, SYDNOR T. 

8 YOSHIKUNI, YASUO 

9 NEWMAN , JACK 

10 KHLEBNIKOV , ARTEM VALENTINOVICH 

12 <120> TITLE OF INVENTION: BIOSYNTHESIS OF ISOPENTENYL PYROPHOSPHATE 

14 <130> FILE REFERENCE: 2000-0007 

16 <140> CURRENT APPLICATION NUMBER: 10/006,909 

C--> 17 <141> CURRENT FILING DATE: 2002-04-02 

19 <160> NUMBER OF SEQ ID NOS : 13 

21 <170> SOFTWARE: Patentln Ver . 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1185 

25 <212> TYPE: DNA 

26 <213> ORGANISM: Artificial Sequence 

28 <220> FEATURE: 

29 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

30 Acetoacetyl-CoA thiolase nucleotide sequence 
32 <400> SEQUENCE: 1 

3 3 atgaaaaatt gtgtcatcgt cagtgcggta cgtactgcta tcggtagttt taacggttca 60 

34 ctcgcttcca ccagcgccat cgacctgggg gcgacagtaa ttaaagccgc cattgaacgt 120 

35 gcaaaaatcg attcacaaca cgttgatgaa gtgattatgg gtaacgtgtt acaagccggg 180 

36 ctggggcaaa atccggcgcg tcaggcactg ttaaaaagcg ggctggcaga aacggtgtgc 240 

37 ggattcacgg tcaataaagt atgtggttcg ggtcttaaaa gtgtggcgct tgccgcccag 300 

38 gccattcagg caggtcaggc gcagagcatt gtggcggggg gtatggaaaa tatgagttta 360 

39 gccccctact tactcgatgc aaaagcacgc tctggttatc gtcttggaga cggacaggtt 420 

40 tatgacgtaa tcctgcgcga tggcctgatg tgcgccaccc atggttatca tatggggatt 480 

41 accgccgaaa acgtggctaa agagtacgga attacccgtg aaatgcagga tgaactggcg 540 

42 ctacattcac agcgtaaagc ggcagccgca attgagtccg gtgcttttac agccgaaatc 600 

43 gtcccggtaa atgttgtcac tcgaaagaaa accttcgtct tcagtcaaga cgaattcccg 660 

44 aaagcgaatt caacggctga agcgttaggt gcattgcgcc cggccttcga taaagcagga 720 

45 acagtcaccg ctgggaacgc gtctggtatt aacgacggtg ctgccgctct ggtgattatg 780 

46 gaagaatctg cggcgctggc agcaggcctt acccccctgg ctcgcattaa aagttatgcc 840 

47 agcggtggcg tgccccccgc attgatgggt atggggccag tacctgccac gcaaaaagcg 900 

48 ttacaactgg cggggctgca actggcggat attgatctca ttgaggctaa tgaagcattt 960 

49 gctgcacagt tccttgccgt tgggaaaaac ctgggctttg attctgagaa agtgaatgtc 1020 

50 aacggcgggg ccatcgcgct cgggcatcct atcggtgcca gtggtgctcg tattctggtc 1080 

51 acactattac atgccatgca ggcacgcgat aaaacgctgg ggctggcaac actgtgcatt 1140 

52 ggcggcggtc agggaattgc gatggtgatt gaacggttga attaa 1185 
55 <210> SEQ ID NO: 2 



ENTERED 
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56 <211> LENGTH: 1476 

57 <212> TYPE: DNA 

58 <213> ORGANISM: Artificial Sequence 

60 <220> FEATURE: 

61 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

62 HMG-CoA synthase 

64 <400> SEQUENCE: 2 

65 atgaaactct caactaaact 

66 caacaacaat tacacaatac 

67 gaacaaaaaa ccagacctca 

68 caatgtgtca accaatctga 

69 attggtctgg gccaaaccaa 

70 tccctaactg ttttgtctaa 

71 agattagaag tcggtactga 

72 atgcaattgt ttggtgaaaa 

73 ggtggtacca acgcgttgtt 

74 agagacgcca ttgtagtttg 

75 accggtggtg ccggtactgt 

76 tctgtaagag cttcttacat 

77 gaatatcctt acgtcgatgg 

78 gtttacaaga gttattccaa 

79 tcggatgctt tgaacgtttt 

80 aaattggtca caaaatcata 

81 ttgttcccag aagttgacgc 

82 aagaacattg aaaaaacttt 

83 caatctttga ttgttccaac 

84 tttgcatctc tattaaacta 
8 5 ttttcttacg gttccggttt 

86 caacatatta tcaaggaatt 

87 ccaaaggatt acgaagctgc 

88 aaacctcaag gttccattga 

89 gacaaattta gaagatctta 

92 <210> SEQ ID NO: 3 

93 <211> LENGTH: 1509 

94 <212> TYPE: DNA 

95 <213> ORGANISM: Artificial Sequence 

97 <220> FEATURE: 

98 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

99 HMG-CoA reductase nucleotide sequence 

101 <400> SEQUENCE: 3 

102 atggttttaa ccaataaaac agtcatttct ggatcgaaag tcaaaagttt atcatctgcg 60 

103 caatcgagct catcaggacc ttcatcatct agtgaggaag atgattcccg cgatattgaa 120 

104 agcttggata agaaaatacg tcctttagaa gaattagaag cattattaag tagtggaaat 180 

105 acaaaacaat tgaagaacaa agaggtcgct gccttggtta ttcacggtaa gttacctttg 240 

106 tacgctttgg agaaaaaatt aggtgatact acgagagcgg ttgcggtacg taggaaggct 300 

107 ctttcaattt tggcagaagc tcctgtatta gcatctgatc gtttaccata taaaaattat 360 

108 gactacgacc gcgtatttgg cgcttgttgt gaaaatgtta taggttacat gcctttgccc 420 

109 gttggtgtta taggcccctt ggttatcgat ggtacatctt atcatatacc aatggcaact 480 

110 acagagggtt gtttggtagc ttctgccatg cgtggctgta aggcaatcaa tgctggcggt 540 



e nucleotide sequence 



ttgttggtgt 
aaacttgcaa 
aaatgtcggt 
gctagagaaa 
catgtctttt 
gttgatcaag 
aactctgatt 
cactgacgtc 
caactctttg 
cggtgatatt 
tgctatgtgg 
ggaacacgcc 
tcatttttca 
gaaggctatt 
gaaatatttc 
cggtagatta 
cgaattagct 
tgttaatgtt 
aaacacaggt 
tgttggatct 
agctgcatct 
agatattact 
catcgaattg 
gcatttgcaa 
cgatgttaaa 



ggtattaaag 
atgactgaac 
attaaaggta 
tttgatggcg 
gtcaatgaca 
agttacaaca 
gacaagtcca 
gaaggtattg 
aactggattg 
gccatctacg 
atcggtcctg 
tacgattttt 
ttaacttgtt 
tctaaagggt 
gactacaacg 
ctatataacg 
actcgcgatt 
gctaagccat 
aacatgtaca 
gacgacttac 
ctatattctt 
aacaaattag 
agagaaaatg 
agtggtgttt 
aaataa 



gaagacttag 
taaaaaaaca 
tccaaattta 
tttctcaagg 
gagaagatat 
tcgacaccaa 
agtctgtcaa 
acacgcttaa 
aatctaacgc 
ataagggtgc 
atgctccaat 
acaagccaga 
acgtcaaggc 
tggttagcga 
ttttccatgt 
atttcagagc 
atgacgaatc 
tccacaaaga 
ccgcatctgt 
aaggcaagcg 
gcaaaattgt 
ccaagagaat 
cccatttgaa 
actacttgac 



gccgcaaaag 
aaagaccgct 
catcccaact 
taaatacaca 
ctactcgatg 
caaaattggt 
gtctgtcttg 
tgcctgttac 
atgggatggt 
cgcaagacca 
tgtatttgac 
tttcaccagc 
tcttgatcaa 
tcccgctggt 
tccaacctgt 
caatcctcaa 
tttaaccgat 
gagagttgcc 
ttatgccgcc 
tgttggttta 
tggtgacgtc 
caccgaaact 
gaagaacttc 
caacatcgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1476 
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gcccagtagt 
cagaagaggg 
tgcaacatat 
ctggtgacgc 
tggtagaaga 
ccgacaaaaa 
aagctactat 
ttgagttgaa 
ttaacgcaca 
cacaaaatgt 
gaatttccgt 
aaccacaagg 
gtaccaacgc 
ccttatgtgc 
aacctgctga 
atgggtccgt 



ccgtttccca 
acaaaacgca 
tcaaacttgt 
aatgggtatg 
gtatggctgg 
accagctgcc 
tcctggtgat 
cattgctaag 
tgcagctaat 
tgaaagttcc 
atccatgcca 
tgccatgttg 
acgtcaatta 
tgccctagca 
accaacaaaa 
cacctgcatt 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1509 



111 ggtgcaacaa ctgttttaac taaggatggt atgacaagag 

112 actttgaaaa gatctggtgc ctgtaagata tggttagact 

113 attaaaaaag cttttaactc tacatcaaga tttgcacgtc 

114 ctagcaggag atttactctt catgagattt agaacaacta 

115 aatatgattt ctaaaggtgt cgaatactca ttaaagcaaa 

116 gaagatatgg aggttgtctc cgtttctggt aactactgta 

117 atcaactgga tcgaaggtcg tggtaagagt gtcgtcgcag 

118 gttgtcagaa aagtgttaaa aagtgatgtt tccgcattgg 

119 aatttggttg gatctgcaat ggctgggtct gttggtggat 

120 ttagtgacag ctgttttctt ggcattagga caagatcctg 

121 aactgtataa cattgatgaa agaagtggac ggtgatttga 

122 tccatcgaag taggtaccat cggtggtggt actgttctag 

123 gacttattag gtgtaagagg cccgcatgct accgctcctg 

124 gcaagaatag ttgcctgtgc cgtcttggca ggtgaattat 

125 gccggccatt tggttcaaag tcatatgacc cacaacagga 

126 cctaacaatt tggacgccac tgatataaat cgtttgaaag 

127 aaatcctaa 

130 <210> SEQ ID NO: 4 

131 <211> LENGTH: 1332 

132 <212> TYPE: DNA 

133 <213> ORGANISM: Artificial Sequence 

135 <220> FEATURE: 

136 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

137 Mevalonate kinase nucleotide sequence 

139 <400> SEQUENCE: 4 

140 atgtcattac cgttcttaac ttctgcaccg ggaaaggtta 

141 gctgtgtaca acaagcctgc cgtcgctgct agtgtgtctg 

142 ataagcgagt catctgcacc agatactatt gaattggact 

143 cataagtggt ccatcaatga tttcaatgcc atcaccgagg 

144 ttggccaagg ctcaacaagc caccgatggc ttgtctcagg 

145 ccgttgttag ctcaactatc cgaatccttc cactaccatg 

146 atgtttgttt gcctatgccc ccatgccaag aatattaagt 

147 cccatcggtg ctgggttggg ctcaagcgcc tctatttctg 

148 gcctacttgg gggggttaat aggatctaat gacttggaaa 

149 catatagtga atcaatgggc cttcataggt gaaaagtgta 

150 atagataacg ctgtggccac ttatggtaat gccctgctat 

151 ggaacaataa acacaaacaa ttttaagttc ttagatgatt 

152 ctaacctata ctagaattcc aaggtctaca aaagatcttg 

153 gtcaccgaga aatttcctga agttatgaag ccaattctag 

154 ctacaaggct tagagatcat gactaagtta agtaaatgta 

155 gtagaaacta ataatgaact gtatgaacaa ctattggaat 

156 ctgcttgtct caatcggtgt ttctcatcct ggattagaac 

157 gatttgagaa ttggctccac aaaacttacc ggtgctggtg 

158 ttgttacgaa gagacattac tcaagagcaa attgacagct 

159 gattttagtt acgagacatt tgaaacagac ttgggtggga 

160 gcaaaaaatt tgaataaaga tcttaaaatc aaatccctag 



161 aaaactacca caaagcaaca aattgacgat ctattattgc 

162 tggacttcat ag 

165 <210> SEQ ID NO: 5 



ttatttttgg 
cgttgagaac 
tcccggacat 
atcaagtaaa 
aactcgttag 
cagcgttttg 
tttctttaaa 
tatcactggc 
agctgtcaga 
ttcacggtac 
ttgaaaaaga 
tcccagccat 
ttgctcgcgt 
atgccatggg 
aaggcaccga 
tgataagaat 
ttattaaaaa 
gcggcggttg 
tcaaaaagaa 
ctggctgctg 
tattccaatt 
caggaaacac 



tgaacactct 
ctacctgcta 
tagctttaat 
ctcccaaaaa 
tcttttggat 
tttcctgtat 
gtctacttta 
cttagctatg 
aaacgataag 
cccttcagga 
ctcacataat 
tccaatgatc 
tcgtgtgttg 
tgaatgtgcc 
tgacgaggct 
aaatcatgga 
tctgagcgat 
ctctttgact 
attgcaagat 
tttgttaagc 
atttgaaaat 
gaatttacca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1332 
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166 <211> LENGTH: 1356 

167 <212> TYPE: DNA 

168 <213> ORGANISM: Artificial Sequence 

170 <220> FEATURE: 

171 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

172 Phosphomevalonate kinase nucleotide sequence 
174 <400> SEQUENCE: 5 

17 5 atgtcagagt tgagagcctt cagtgcccca gggaaagcgt tactagctgg tggatattta 60 

176 gttttagata caaaatatga agcatttgta gtcggattat cggcaagaat gcatgctgta 120 

177 gcccatcctt acggttcatt gcaagggtct gataagtttg aagtgcgtgt gaaaagtaaa 180 
17 8 caatttaaag atggggagtg gctgtaccat ataagtccta aaagtggctt cattcctgtt 240 

179 tcgataggcg gatctaagaa ccctttcatt gaaaaagtta tcgctaacgt atttagctac 300 

180 tttaaaccta acatggacga ctactgcaat agaaacttgt tcgttattga tattttctct 360 

181 gatgatgcct accattctca ggaggatagc gttaccgaac atcgtggcaa cagaagattg 420 

182 agttttcatt cgcacagaat tgaagaagtt cccaaaacag ggctgggctc ctcggcaggt 480 

183 ttagtcacag ttttaactac agctttggcc tccttttttg tatcggacct ggaaaataat 540 

184 gtagacaaat atagagaagt tattcataat ttagcacaag ttgctcattg tcaagctcag 600 

185 ggtaaaattg gaagcgggtt tgatgtagcg gcggcagcat atggatctat cagatataga 660 

186 agattcccac ccgcattaat ctctaatttg ccagatattg gaagtgctac ttacggcagt 720 

187 aaactggcgc atttggttga tgaagaagac tggaatatta cgattaaaag taaccattta 780 

188 ccttcgggat taactttatg gatgggcgat attaagaatg gttcagaaac agtaaaactg 840 

189 gtccagaagg taaaaaattg gtatgattcg catatgccag aaagcttgaa aatatataca 900 

190 gaactcgatc atgcaaattc tagatttatg gatggactat ctaaactaga tcgcttacac 960 

191 gagactcatg acgattacag cgatcagata tttgagtctc ttgagaggaa tgactgtacc 1020 

192 tgtcaaaagt atcctgaaat cacagaagtt agagatgcag ttgccacaat tagacgttcc 1080 

193 tttagaaaaa taactaaaga atctggtgcc gatatcgaac ctcccgtaca aactagctta 1140 

194 ttggatgatt gccagacctt aaaaggagtt cttacttgct taatacctgg tgctggtggt 1200 

195 tatgacgcca ttgcagtgat tactaagcaa gatgttgatc ttagggctca aaccgctaat 1260 

196 gacaaaagat tttctaaggt tcaatggctg gatgtaactc aggctgactg gggtgttagg 1320 

197 aaagaaaaag atccggaaac ttatcttgat aaatag 1356 

200 <210> SEQ ID NO: 6 

201 <211> LENGTH: 1191 

202 <212> TYPE: DNA 

203 <213> ORGANISM: Artificial Sequence 

205 <220> FEATURE: 

206 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 
207 
208 



Mevalonate pyrophosphate decarboxylase nucleotide 
sequence 

210 <400> SEQUENCE: 6 

211 atgaccgttt acacagcatc cgttaccgca cccgtcaaca tcgcaaccct taagtattgg 60 

212 gggaaaaggg acacgaagtt gaatctgccc accaattcgt ccatatcagt gactttatcg 120 

213 caagatgacc tcagaacgtt gacctctgcg gctactgcac ctgagtttga acgcgacact 180 

214 ttgtggttaa atggagaacc acacagcatc gacaatgaaa gaactcaaaa ttgtctgcgc 240 

215 gacctacgcc aattaagaaa ggaaatggaa tcgaaggacg cctcattgcc cacattatct 300 

216 caatggaaac tccacattgt ctccgaaaat aactttccta cagcagctgg tttagcttcc 360 

217 tccgctgctg gctttgctgc attggtctct gcaattgcta agttatacca attaccacag 420 

218 tcaacttcag aaatatctag aatagcaaga aaggggtctg gttcagcttg tagatcgttg 480 

219 tttggcggat acgtggcctg ggaaatggga aaagctgaag atggtcatga ttccatggca 540 

220 gtacaaatcg cagacagctc tgactggcct cagatgaaag cttgtgtcct agttgtcagc 600 
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221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
233 
234 
235 
236 
238 
239 
240 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
257 
258 
259 
260 
261 
262 
263 
264 
265 
266 
267 
268 
269 
270 
271 
272 
273 



1080 
1140 
1191 



gatattaaaa aggatgtgag ttccactcag ggtatgcaat tgaccgtggc aacctccgaa 660 
ctatttaaag aaagaattga acatgtcgta ccaaagagat ttgaagtcat gcgtaaagcc 720 
attgttgaaa aagatttcgc cacctttgca aaggaaacaa tgatggattc caactctttc 7 80 
catgccacat gtttggactc tttccctcca atattctaca tgaatgacac ttccaagcgt 840 
atcatcagtt ggtgccacac cattaatcag ttttacqgaq aaacaatcgt tgcatacacg 900 
tttgatgcag gtccaaatgc tgtgttgtac tacttagctg aaaatgagtc gaaactcttt 960 
gcatttatct ataaattgtt tggctctgtt cctggatggg acaagaaatt tactactgag 1020 
cagcttgagg ctttcaacca tcaatttgaa tcatctaact ttactgcacg tgaattggat 
cttgagttgc aaaaggatgt tgccagagtg attttaactc aagtcggttc aggcccacaa 
gaaacaaacg aatctttgat tgacgcaaag actggtctac caaaggaata a 
<210> SEQ ID NO: 7 
<211> LENGTH: 9253 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

"single operon" nucleotide sequence 
<400> SEQUENCE: 7 

gacgcttttt atcgcaactc tctactgttt ctccataccc gtttttttgg gctagcagga 60 
ggaattcacc atggtacccg ggaggaggat tactatatgc aaacggaaca cgtcatttta 120 
ttgaatgcac agggagttcc cacgggtacg ctggaaaagt atgccgcaca cacggcagac 180 
acccgcttac atctcgcgtt ctccagttgg ctgtttaatg ccaaaggaca attattagtt 240 
acccgccgcg cactgagcaa aaaagcatgg cctggcgtgt ggactaactc ggtttgtggg 300 
cacccacaac tgggagaaag caacgaagac gcagtgatcc gccgttgccg ttatgagctt 360 
ggcgtggaaa ttacgcctcc tgaatctatc tatcctgact ttcgctaccg cgccaccgat 420 
ccgagtggca ttgtggaaaa tgaagtgtgt ccggtatttg ccgcacgcac cactagtgcg 480 
ttacagatca atgatgatga agtgatggat tatcaatggt gtgatttagc agatgtatta 540 
cacggtattg atgccacgcc gtgggcgttc agtccgtgga tggtgatgca ggcgacaaat 600 
cgcgaagcca gaaaacgatt atctgcattt acccagctta aataacccgg ggatcctcta 660 
gagtcgacta ggaggaatat aaaatgaaaa attgtgtcat cgtcagtgcg gtacgtactg 720 
ctatcggtag ttttaacggt tcactcgctt ccaccagcgc catcgacctg ggggcgacag 780 
taattaaagc cgccattgaa cgtgcaaaaa tcgattcaca acacgttgat gaagtgatta 840 
tgggtaacgt gttacaagcc gggctggggc aaaatccggc gcgtcaggca ctgttaaaaa 900 
gcgggctggc agaaacggtg tgcggattca cggtcaataa agtatgtggt tcgggtctta 960 
aaagtgtggc gcttgccgcc caggccattc aggcaggtca ggcgcagagc attgtggcgg 1020 
ggggtatgga aaatatgagt ttagccccct acttactcga tgcaaaagca cgctctggtt 1080 
atcgtcttgg agacggacag gtttatgacg taatcctgcg cgatggcctg atgtgcgcca 1140 
cccatggtta tcatatgggg attaccgccg aaaacgtggc taaagagtac ggaattaccc 1200 
gtgaaatgca ggatgaactg gcgctacatt cacagcgtaa agcggcagcc gcaattgagt 1260 
ccggtgcttt tacagccgaa atcgtcccgg taaatgttgt cactcgaaag aaaaccttcg 1320 
tcttcagtca agacgaattc ccgaaagcga attcaacggc tgaagcgtta ggtgcattgc 1380 
gcccggcctt cgataaagca ggaacagtca ccgctgggaa cgcgtctggt attaacgacg 1440 
gtgctgccgc tctggtgatt atggaagaat ctgcggcgct ggcagcaggc cttacccccc 1500 
tggctcgcat taaaagttat gccagcggtg gcgtgccccc cgcattgatg ggtatggggc 1560 
cagtacctgc cacgcaaaaa gcgttacaac tggcggggct gcaactggcg gatattgatc 1620 
tcattgaggc taatgaagca tttgctgcac agttccttgc cgttgggaaa aacctgggct 1680 
ttgattctga gaaagtgaat gtcaacggcg gggccatcgc gctcgggcat cctatcggtg 1740 
ccagtggtgc tcgtattctg gtcacactat tacatgccat gcaggcacgc gataaaacgc 1800 
tggggctggc aacactgtgc attggcggcg gtcagggaat tgcgatggtg attgaacggt 1860 
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